AITopics | adversarial bandit

On Optimal Robustness to Adversarial Corruption in Online Decision Problems

Neural Information Processing SystemsApr-25-2026, 13:15:29 GMT

This paper considers two fundamental sequential decision-making problems: the problem of prediction with expert advice and the multi-armed bandit problem. We focus on stochastic regimes in which an adversary may corrupt losses, and we investigate what level of robustness can be achieved against adversarial corruption. The main contribution of this paper is to show that optimal robustness can be expressed by a square-root dependency on the amount of corruption.

artificial intelligence, data mining, machine learning, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Online Lazy Gradient Descent is Universal on Strongly Convex Domains

Neural Information Processing SystemsApr-25-2026, 07:56:36 GMT

We study Online Lazy Gradient Descent for optimisation on a strongly convex domain. The algorithm is known to achieve O( N) regret against adversarial opponents; here we show it is universal in the sense that it also achieves O(log N) expected regret against i.i.d opponents. This improves upon the more complex metaalgorithm of Huang et al [20] that only gets O( Nlog N) and O(log N) bounds. In addition we show that, unlike for the simplex, order bounds for pseudo-regret and expected regret are equivalent for strongly convex domains.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Europe > Austria (0.28)
Europe > Ireland > Leinster > County Dublin > Dublin (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.64)

Add feedback

0c8d3770cbb759430f4f4679abe3ab80-Paper-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 13:09:55 GMT

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Data Science > Data Mining > Big Data (0.47)

Add feedback

e36da7acd188c6655792270b38830124-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 11:16:18 GMT

feedback graph, graph, independence number, (13 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Lombardy > Milan (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Education > Educational Setting > Online (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.46)

Add feedback

e655c7716a4b3ea67f48c6322fc42ed6-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 21:38:02 GMT

algorithm, attacker, corruption, (14 more...)

Neural Information Processing Systems

Country:

Europe > Denmark > Capital Region > Copenhagen (0.04)
Asia > China > Hong Kong (0.04)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
North America > Canada (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.71)

Add feedback

6590cb829f5ffef50050f3e5845fbb4c-Paper-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 11:29:49 GMT

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.71)
Information Technology > Data Science > Data Mining > Big Data (0.48)

Add feedback

2e907f44e0a9616314cf3d964d4e3c93-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 02:05:41 GMT

algorithm, cost vector, opponent, (12 more...)

Neural Information Processing Systems

Country:

Europe > Ireland > Leinster > County Dublin > Dublin (0.14)
Europe > Austria > Vienna (0.14)
Europe > Russia (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.63)

Add feedback

0c8d3770cbb759430f4f4679abe3ab80-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 10:43:35 GMT

algorithm, base learner, learner, (15 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.40)

Add feedback

Online EXP3 Learning in Adversarial Bandits with Delayed Feedback

Neural Information Processing SystemsDec-25-2025, 21:02:10 GMT

Consider a player that in each of T rounds chooses one of K arms. An adversary chooses the cost of each arm in a bounded interval, and a sequence of feedback delays \left{ d {t} at round t, the player receives the cost of playing this arm d {t}>T, this feedback is simply missing. We prove that the EXP3 algorithm (that uses the delayed feedback upon its arrival) achieves a regret of O\left(\sqrt{\ln K\left(KT+\sum {t}\right)}\right). For the case where \sum {t} and T are unknown, we propose a novel doubling trick for online learning with delays and prove that this adaptive EXP3 achieves a regret of O\left(\sqrt{\ln K\left(K^{2}T+\sum {t}\right)}\right). We then consider a two player zero-sum game where players experience asynchronous delays. We show that even when the delays are large enough such that players no longer enjoy the "no-regret property", (e.g., where d {t} that is not summable but is square summable, and proving a "weighted regret bound" for this general case.

adversarial bandit, name change, online exp3 learning, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

A Gang of Adversarial Bandits

Neural Information Processing SystemsDec-23-2025, 19:06:56 GMT

We consider running multiple instances of multi-armed bandit (MAB) problems in parallel. A main motivation for this study are online recommendation systems, in which each of $N$ users is associated with a MAB problem and the goal is to exploit users' similarity in order to learn users' preferences to $K$ items more efficiently. We consider the adversarial MAB setting, whereby an adversary is free to choose which user and which loss to present to the learner during the learning process. Users are in a social network and the learner is aided by a-priori knowledge of the strengths of the social links between all pairs of users. It is assumed that if the social link between two users is strong then they tend to share the same action. The regret is measured relative to an arbitrary function which maps users to actions. The smoothness of the function is captured by a resistance-based dispersion measure $\Psi$. We present two learning algorithms, GABA-I and GABA-II, which exploit the network structure to bias towards functions of low $\Psi$ values.

adversarial bandit, mathcal, name change, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.59)

Add feedback

Filters

Collaborating Authors

adversarial bandit

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

On Optimal Robustness to Adversarial Corruption in Online Decision Problems

Online Lazy Gradient Descent is Universal on Strongly Convex Domains

0c8d3770cbb759430f4f4679abe3ab80-Paper-Conference.pdf

e36da7acd188c6655792270b38830124-Paper-Conference.pdf

e655c7716a4b3ea67f48c6322fc42ed6-Supplemental.pdf

6590cb829f5ffef50050f3e5845fbb4c-Paper-Conference.pdf

2e907f44e0a9616314cf3d964d4e3c93-Paper.pdf

0c8d3770cbb759430f4f4679abe3ab80-Paper-Conference.pdf

Online EXP3 Learning in Adversarial Bandits with Delayed Feedback

A Gang of Adversarial Bandits